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Methods for Producing L-Amino Acids 



Background of the Invention 

Field of the Invention 

The present invention relates, in general, to a method of producing L- 
amino acids and to a gene encoding phosphoglucoisomerase. 

Background Information 

Bacterial cells are used industrially to produce amino acids by 
fermentation processes (Ishino, S. et al, J. Gen. Appl Microbiol 57:157-165 
(1991), Kinoshita, S., Nakayama, K. and Nagasaki, S., J. Gen. Appl Microbiol 
4:128-129 (1958)). Although numerous research reports and reviews have 
appeared concerning fermentation processes and the mechanisms of accumulation 
of amino acids, more progress needs to be made to increase the yields of amino 
acids from microorganisms (Ishino, S. et aL , J. Gen. Appl Microbiol 3 7: 1 57- 1 65 
(1991), Aida, K. et aL, eds., "Biotechnology of Amino Acid Production," 
Kodansha (Tokyo)/Elsevier (New York) (1986) and Marx, A. et al, Metabolic 
Engineering 7:35-48 (1999)). 

There has been some success in using metabolic engineering to direct the 
flux of glucose derived carbons toward aromatic amino acid formation (Flores, 
N. et al, Nature Biotechnol 74:620-623 (1996)). However, the successful 
application in producer strains has not yet been documented (Berry, A., TIBTECH 
74:250-256 (1996)). 

Metabolic engineering relates to manipulation of the flow of carbons of 
starting materials, such as carbohydrates and organic acids, through the variety 
of metabolic pathways during fermentation. Studies have been done, for 
example, on the central metabolism of Corynebacterium glutamicum using I3 C 
NMR studies (Ishino, S. et al.,J. Gen. Appl Microbiol 37:157-165(1991 ), Marx, 
A. et al , Biotechnology and Bioengineering 49: 1 1 1 - 1 29 ( 1 996)). Additionally, 
also using 13 C NMR, Walker et al (Walker, T. et al,J. Biol Chem. 257:1 1 89- 
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1195 (1982)) analyzed glutamic acid fermentation by Microbacterium 
ammoniaphilum, and Inbar et al. (Inbar, L. etal 9 Eur, J. Biochem. 1 49:601-607 
(1985)) studied lysine fermentation by Brevibacterium flavum. 

The present invention solves a problem of improving yields of amino 
5 acids during fermentation using metabolic engineering. 

Summary of the Invention 

The present invention provides a method of producing L-amino acids by 
culturing altered bacterial cells having increased amounts of NADPH as 
compared to unaltered bacterial cells, whereby L-amino acid yields from said 
10 altered bacterial cells are greater than yields from unaltered bacterial cells. 

The present invention also provides a method of producing a bacterial cell 
with a mutated phosphoglucose isomerase (pgi) gene comprising (a) subcloning 
an internal region of the pgi gene into a suicide vector; and (b) inserting said 
suicide vector into a bacterial genome, via homologous recombination, whereby 
15 a bacterial cell with an altered pgi gene is produced. The invention further 

provides an altered bacterial cell produced according to this method. 

The invention also provides a vector useful according to this method. 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the Corynebacterium glutamicum 
20 phosphoglucose isomerase polypeptide having the amino acid sequence shown 

in Figure 1 (SEQ ID NO:2) or one of the amino acid sequence encoded by the 
DNA clone deposited in a bacterial host as NRRL Deposit Number B-30174 on 
August 17, 1999. 

The present invention also relates to recombinant vectors, which include 
25 the isolated nucleic acid molecules of the present invention, and to host cells 

containing the recombinant vectors, as well as to methods of making such vectors 
and host cells and for using them for production of Pgi polypeptides or peptides 
by recombinant techniques. 
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The invention further provides an isolated Pgi peptide having an amino 
acid sequence encoded by a polynucleotide described herein. 

Further advantages of the present invention will be clear from the 
description that follows. 

Brief Description of the Figures 

Figure 1A-1C shows the nucleotide (SEQ ID NO:l) and deduced amino 
acid (SEQ ID NO:2) sequences of pgi. The Pgi peptide has a deduced molecular 
weight of about 59 KDa. 

Detailed Description of the Invention 

It has been determined herein that increased amounts of N ADPH in a 
bacterial cell increase product yield, specifically in anabolic processes where 
NADPH is a limiting factor. A way of carrying chemical energy from reactions 
of catabolism to the energy-requiring reactions of biosynthesis, such as the 
formation of amino acids, is in the form of hydrogen atoms or electrons. To be 
effective as reducing agents, hydrogen atoms must have considerable free energy. 
Such high-energy hydrogen atoms are obtained from cell fuels by 
dehydrogenases, which catalyze removal of hydrogen atoms from fuel molecules 
and their transfer to specific coenzymes, particularly to the oxidized form of 
nicotinamide adenine dinucleotide phosphate (NADP + ). The reduced, or 
hydrogen-carrying, form of this coenzyme, designated NADPH, is a carrier of 
energy-rich electrons from catabolic reactions to electron-requiring biosynthetic 
reactions. 

The present invention provides a method for producing L-amino acids by 
culturing altered bacterial cells having increased amounts of NADPH as 
compared to unaltered bacterial cells whereby L-amino acid yields from said 
altered bacterial cells are greater than yields from unaltered bacterial cells. 
Preferred amino acids are L-lysine, L-threonine and L-isoleucine. As used herein, 
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an altered bacterial cell is defined as a bacterial cell which has increased amount 
of NADPH as compared to an unaltered bacterial cell. 

In one preferred embodiment, an "altered" bacterial cell is a "mutated" 
bacterial cell. A "mutation" is any detectable change in the genetic material 
which can be transmitted to daughter cells. A mutation can be any (or a 
combination of) detectable, unnatural change affecting the chemical or physical 
constitution, mutability, replication, phenotypic function, or recombination of one 
or more deoxyribonucleotides; nucleotides can be added, deleted, substituted for, 
inverted, or transposed to new positions with and without inversion. Mutations 
can occur spontaneously and can be induced experimentally by application of 
mutagens or recombinant DNA technology. A mutant variation of a nucleic acid 
molecule results from a mutation. A mutant polypeptide can result from a mutant 
nucleic acid molecule. 

Additionally, an altered or mutated bacterial cell can be genetically 
"mutated" to yield an increased amount ofN ADPH as compared to the genetically 
"unmutated" cell. 

An increased amount of NADPH in the altered bacterial cell results in 
increased production of amino acids. Preferably, in an altered bacterial cell, 
amino acid yields are increased over yields from the unaltered cell from greater 
than about 1%, and preferably from about 1% to about 100%, preferably from 
about 2% to about 80%, and more preferably, from about 5% to about 60%, and 
even more preferably from about 10% to about 80%. As used herein, "yield" is 
defined as grams of amino acid produced, multiplied by 100, divided by grams 
of glucose consumed. 

In agreement with the present invention, the altered bacterial cell of the 
present invention is cultured in a culture medium that comprises a carbon source 
and a nitrogen source. The carbon source can be, for example, arabinose, 
cellobiose, fructose, glucose, lactose, maltose, mannose, rhamnose, raffinose, 
sorbose, sucrose, trehalose, pyruvate, or succinate. The carbon source is 
preferably at an initial concentration of 0.1 to 10%, preferably 0.5 to 6.0% by 
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weight. Ail of the carbon source can be added to the medium before the start of 
culturing, or it can be added step by step or continuously during culturing. 

The medium used herein can be solid or liquid, synthetic (i.e. man-made) 
or natural, and contains sufficient nutrients for the cultivation of the altered 
bacterial cell of the present invention. Preferably, the medium employed is a 
liquid medium, more preferably a synthetic liquid medium. 

The natural or synthetic culture media used in the above and below 
described embodiments of the invention also contain a nitrogen source, suitable 
inorganic salts, and, as appropriate, various trace nutrients, growth factors and the 
like suitable for cultivation of the altered bacterial cell, and can also contain at 
least one supplementary carbon source. The amount of each of these additional 
ingredients to be employed is preferably selected to maximize amino acid 
production. Such amounts can be determined empirically by one skilled in the art 
according to the various methods and techniques known in the art. 

Illustrative examples of suitable supplemental carbon sources include, but 
are not limited to: other carbohydrates, such as glucose, fructose, sucrose, starch 
or starch hydrolysate, cellulose hydrolysate and molasses; organic acids, such as 
acetic acid, propionic acid, lactic acid, formic acid, malic acid, citric acid, and 
fiimaric acid; and alcohols, such as glycerol, inositol, mannitol and sorbitol. 

Illustrative examples of suitable nitrogen sources include, but are not 
limited to: ammonia, including ammonia gas and aqueous ammonia; ammonium 
salts of inorganic or organic acids, such as ammonium chloride, ammonium 
nitrate, ammonium phosphate, ammonium sulfate and ammonium acetate; urea; 
nitrate or nitrite salts, and other nitrogen-containing materials, including amino 
acids as either pure or crude preparations, meat extract, peptone, fish meal, fish 
hydrolysate, corn steep liquor, casein hydrolysate, soybean cake hydrolysate, yeast 
extract, dried yeast, ethanol-yeast distillate, soybean flour, cottonseed meal, and 
the like. 

Illustrative examples of suitable inorganic salts include, but are not 
limited to: salts of potassium, calcium, sodium, magnesium, manganese, iron, 
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cobalt, zinc, copper, molybdenum, tungsten and other trace elements, and 
phosphoric acid. 

Illustrative examples of appropriate trace nutrients, growth factors, and the 
like include, but are not limited to: coenzyme A, pantothenic acid, pyridoxine- 
HC1, biotin, thiamine, riboflavin, flavine mononucleotide, flavine adenine 
dinucleotide, DL-6,8-thioctic acid, folic acid, Vitamin B 12 , other vitamins, bases 
such as adenine, uracil, guanine, thymine and cytosine, L amino acids, sodium 
thiosulfate,/?- or r-aminobenzoic acid, niacinamide, nitriloacetate, and the like, 
either as pure or partially purified chemical compounds or as present in natural 
materials. Cultivation of the inventive microorganism strain can be accomplished 
using any of the submerged fermentation techniques known to those skilled in the 
art, such as airlift, traditional sparged -agitated designs, or in shaking culture. 

The culture conditions employed, including temperature, pH, aeration rate, 
agitation rate, culture duration, and the like, can be determined empirically by one 
skilled in the art to maximize amino acid production. The selection of specific 
culture conditions depends upon factors such as medium composition and type, 
culture technique, and similar considerations. 

After cultivation for a sufficient period of time, until one or more kinds 
of amino acids that have accumulated in the cells and/or culture broth can be 
isolated according to any of the known methods including ion exchange 
chromatography, gel filtration, solvent extraction, affinity chromatography, or any 
combination thereof. Any method that is suitable with the conditions employed 
for cultivation can be used. 

Preferred bacterial cells are Corynebacterial species and Escherichia coli. 
Preferred among bacterial cells are Corynebacterium glutamicum cells. As used 
herein, Brevibacterium Jlavum and Brevibacterium lactofermentum are 
synonymous with Corynebacterium glutamicum. 

In the present invention, in general, increased NADPH within a 
microorganism is achieved by altering the carbon flux distribution between the 
glycolytic and pentose phosphate pathways of that organism. As used herein. 
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"carbon flux" refers to the number of glucose molecules which proceed down a 
particular metabolic path relative to competing paths. 

Preferably, NADPH availability is increased by increasing the carbon flux 
through the oxidative branch of the pentose phosphate pathway. Theoretically, 
1 2 NADPH's are generated per glucose when glucose is exclusively metabolized 
in the pentose phosphate pathway, but only two NADPH's are produced per 
glucose metabolized in the TCA cycle (tricarboxylic acid, also called the citric 
acid cycle). Ishino, S. etal,J. Gen, Appl Microbiol 57:157-165 (1991). The 
present invention provides a method of producing L-amino acids by culturing an 
altered bacterial cell which has an increase in the carbon flux through the pentose 
phosphate pathway. 

Most of the glucose catabolized in living organisms proceeds through 
glycolysis resulting in the formation of pyruvate. The pentose phosphate 
pathway, also called the hexose monophosphate shunt, is an alternative route for 
glucose catabolism. The pentose phosphate pathway produces NADPH and 
under lysine fermentation conditions is more active. Ishino, S. et al 9 J, Gen. 
Appl Microbiol 37:157-165 (1991). 

In the present invention, an altered bacterial cell can be one in which 
carbon flux though the oxidative branch of the pentose phosphate pathway is 
increased. Specifically, in the present invention, an altered bacterial cell can be 
one which has an increased amount of one or more enzymes involved in the 
pentose phosphate pathway. Such pentose phosphate enzymes are selected from 
the group comprising glucose 6-phosphate dehydrogenase, transketolase, 
transaldolase, ribulose 5-phosphate-3-epimerase, ribulose 5-phosphate isomerase 
and 6-phosphogluconate dehydrogenase, and 6-phosphogluconoIactonase. 

In a preferred embodiment, the present invention further provides a 
method of producing L-amino acids by culturing an altered bacterial cell with an 
increased amount of malic enzyme relative to an unaltered cell. Malic enzyme 
catalyzes the reaction of malate with N ADP + to produce pyruvate, carbon dioxide, 
NADPH and H + . 
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In a preferred embodiment, the present invention further provides a 
method of producing L-amino acids by culturing an altered bacterial cell with an 
increased amount of isocitrate dehydrogenase relative to an unaltered cell. 
Isocitrate dehydrogenase catalyzes the reaction of isocitrate with NADP + to 
produce a-ketoglutarate, carbon dioxide, NADPH and H + . 

Both glycolysis and the pentose phosphate pathway compete for glucose. 
In the present invention, an altered bacterial cell can be one in which a decrease 
or blockage of the carbon flux though glycolysis results in an increase in the 
carbon flux though the oxidative branch of the pentose phosphate pathway. As 
used in the present invention, an altered bacterial cell can be one in which a 
decrease in carbon flux through glycolysis is achieved through decreasing the 
amount of one or more enzyme(s) involved in glycolysis. Preferred enzymes are 
6-phosphoglucose isomerase, fructose diphosphate aldolase, D-glyceraldehyde 
phosphate dehydrogenase, phophoglycerate kinase, phosphoglycerate mutase, 
endolase or pyruvate kinase. A preferred enzyme is 6-phosphoglucose isomerase. 

A preferred method of decreasing the amount of a glycolytic en2yme in 
an altered bacterial cell is by mutating the gene which encodes the enzyme. As 
used herein, preferred is blocking (null) or weakening (decreased) expression of 
the gene encoding 6-phosphoglucose isomerase ("pgi"). 

A preferred method of blocking (null) or weakening (decreased) 
expression of genes encoding enzymes involved in glycolysis is by using suicide 
vectors (also called integrative vectors). As used herein, a suicide vector is 
defined as a vector which does not replicate autonomously within a particular 
organism, which then is introduced into the cell and recombines into a 
homologous region of the organism's chromosome to cause inseftional 
inactivation of the homologous gene. Insertional inactivation of the gene is 
achieved by disrupting the reading frame of the gene. Insertional inactivation of 
the gene occurs only if an internal portion of the gene is used as the homologous 
region. 

Recombinant constructs can be introduced into the bacterial cells of the 
present invention using well known techniques such as transduction, transfection, 
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transvection, conjugation, electroporation, electrotransfonnation, calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
mediated transfection, and transformation or other methods. Such methods are 
described in many standard laboratory manuals, such as Davis et al y "Basic 
Methods in Molecular Biology," (1986). 

In a preferred embodiment, the altered bacterial cell is produced by (a) 
subcloning an internal region of the pgi gene into a suicide vector; and (b) 
inserting said suicide vector into a bacterial genome via homologous 
recombination. An internal region can be defined as a contiguous DNA sequence 
between but not including the initiation codon and filial codon of the open reading 
frame (ORF) in question. Preferably an internal region is chosen which will 
facilitate genomic integration and result in the expression of a non-functional 
polypeptide from the ORF in question. 

In certain preferred embodiments, the suicide vectors can be inducible, 
mutant-specific and/or condition-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, 
such as temperature and nutrient additives. Other suitable environmental factors 
will be readily apparent to the skilled artisan. 

The altered bacterial cells of the present invention can be transformed 
with suicide vectors which optionally include at least one marker gene. Such 
markers include amikacin, augmentin (amoxicillin plus clavulonic acid), 
ampicillin, cefazolin, cefoxitin, ceftazidime, ceftiofur, cephalothin, 
chloramphenicol, enrofloxacin, erythromycin, florfenicol, gentamicin, imipenem, 
kanamycin, sarafloxicin, tetracycline, ticarcillin, streptomycin, spectinomycin, 
hygromycin, trimethoprim or tilmicosin resistance genes. Preferred markers 
include chloramphenicol and/or kanamycin resistance genes. Other suitable 
markers will be readily apparent to the skilled artisan. 

An illustrative example of the use of suicide vectors is as follows: an 
internal region of a gene is amplified via the polymerase chain reaction, and the 
fragment resulting from the amplification is subcloned into a suicide vector which 
includes an antibiotic resistance marker gene, and the suicide vector is 
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transformed into the original organism. The recovery of antibiotic resistant 
clones implies insertional inactivation of the homologous gene. The suicide 
vector used can include any plasmid incapable of autonomous replication in the 
target organism. In cases where the target organism is not Escherichia coli, Col 
El based replicons are preferred. Among Col El based replicons pBGSBl 
(American Type Culture Collection (ATCC), Manassas, VA, Deposit No. 37443) 
is preferred. 

In a preferred embodiment, the present invention further provides a 
method of producing a bacterial cell with a mutated pgi gene. In a particularly 
preferred embodiment, the invention provides a method of producing a bacterial 
cell with a mutated pgi gene comprising (a) subcloning an internal region of the 
pgi gene into a suicide vector; and (b) inserting said suicide vector into a bacterial 
genome via homologous recombination whereby a bacterial cell with an altered 
pgi gene is produced. 

In a further embodiment, the present invention provides a bacterial cell 
produced according to the above-described methods. 

An illustrative example of production of an altered bacterial cell follows. 
A region of the Corynebacterium glutamicum (C glutamicum) pgi gene which 
encodes 6-phosphoglucose isomerase (a glycolytic enzyme), is amplified by PCR 
using suitable primers. Preferably, the PCR primers are those listed in SEQ ID 
NO:3 and SEQ ID NO:4, which contain the recognition sequence for the 
restriction enzyme Hind III . Following restriction with Hind III, the PCR product, 
is then subcloned into the suicide vector pBGSl 31 . The resulting subclone is 
designated pDPTpgi2. The subclone pDPTpgi2 is then transformed into C. 
glutamicum and kanamycin resistant colonies are selected for on appropriate 
media. The isolation of kanamycin resistant colonies implies that an integration 
event has occurred. Predominantly the integration occurs via homologous 
recombination resulting in disruption of the pgi gene. 

Another preferred method of producing an altered bacterial cell is by 
blocking or weakening expression of the appropriate gene through alteration of 
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the promoter in front of the gene. Preferred is by using a different promoter from 
any source or changing the nucleotide sequence of the native promoter. Preferred 
among methods of changing the nucleotide sequence of the native promoter is 
PCR mutagenesis. Among known bacterial promoters suitable for this use in the 
present invention include the E. coli lacl and lacZ promoters, the T3 and T7 
promoters, the gpt promoter, the lambda P R and P L promoters, the trp promoter, 
the tac promoter or promoters endogenous to the bacterial cells of the present 
invention. Also preferred is upregulation of genes encoding enzymes involved 
in the pentose phosphate pathway. This can be done by alteration of the promoter 
controlling the gene such that a stronger promoter than the native promoter is 
used. Another preferred way of upregulating the genes of the pentose phosphate 
pathway would be increasing the copy number of the genes in question through 
the use of genomic integration or autonomously replicating plasmids. 

In a preferred embodiment, the present invention also provides a method 
of producing L-amino acids comprising culturing an altered bacterial cell, 
wherein said bacterial cell is a Corynebacterium glutamicum cell with a gene 
selected from the group consisting of a mutant pgi gene. 

Another preferred method of producing an altered bacterial cell comprises 
mutating a gene which encodes an enzyme involved in glycolysis to produce 
blocked or weakened expression of the gene encoding the glycolytic enzyme. 
Illustrative examples of suitable methods for preparing mutated genes include, but 
are not limited to: PCR mutagenesis, in vitro chemical mutagenesis, 
oligonucleotide mutagenesis, mutagenesis by irradiation with ultraviolet light or 
X-rays, or by treatment with a chemical mutagen such as nitrosoguaflidine 
(N-methyl-N*-nitro-N- nitrosoguanidine), methylmethanesulfonate, nitrogen 
mustard and the like; gene integration techniques, such as those mediated by 
insertional elements or transposons or by homologous recombination of 
transforming linear or circular DNA molecules; and transduction mediated by 
bacteriophages such as PI. These methods are well known in the art and are 
described, for example, in J.H. Miller, Experiments in Molecular Genetics, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1972); J.H. 
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Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 
Sambrook, E.F. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 
Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York ( 1 989); P.B. Kaufman et al , Handbook of Molecular and Cellular Methods 
in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glick and J.E. Thompson, eds., 
CRC Press, Boca Raton, Florida (1993); and P.F. Smith-Keary, Molecular 
Genetics of Escherichia coli, The Guilford Press, New York, NY (1989). 

In a preferred embodiment, the present invention further provides an 
isolated or purified bacterial cell comprising a mutated pgi gene. 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a Pgi polypeptide having the amino acid 
sequence shown in Figure 1 (SEQ ID NO:2). The nucleotide sequence shown in 
Figure 1 (SEQ ID NO:l) can be obtained by sequencing the DNA clone, which 
was deposited on August 17, 1999 at the Agricultural Research Service Culture 
Collection (NRRL) under the terms of the Budapest Treaty, 1815 North 
University Street, Peoria, Illinois 61604, USA and given accession number B- 
30174. The deposited clone is in the p41-13(C01) plasmid. 

The present invention provides an isolated nucleic acid molecule selected 
from the group consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino acids 
from about 1 to about 540 in SEQ ID NO:2; 

(b) a polynucleotide encoding a polypeptide comprising one of the 
amino acid sequences encoded by the DNA clone contained in 
NRRL Deposit No. B-301 74; 

(c) the complement of (a) or (b); 

(d) a polynucleotide variant created by altering the polynucleotide of 
(a), wherein: 
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(1) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination 
thereof; and 

(2) the number of alterations is equal to or less than 
5 % of the total number of nucleotides present in 

(a) ; 

(e) a polynucleotide variant created by altering the polynucleotide of 

(b) , wherein: 

(1) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination 
thereof; and 

(2) the number of alterations is equal to or less than 5 
% of the total number of nucleotides present in 

(b) ; 

(f) a polynucleotide variant created by altering the polynucleotide of 

(c) , wherein: 

(1) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination 
thereof; and 

(2) the number of alterations is equal to or less than 5 
% of the total number of nucleotides present in (c). 

The present invention further provides the above nucleic acid molecule 
wherein said polynucleotide has the complete nucleotide sequence in SEQ ID 
NO:l. 

The present invention further provides the above nucleic acid molecule 
wherein said polynucleotide has the nucleotide sequence in SEQ ID NO:l 
encoding the Pgi polypeptide having the complete amino acid sequence in SEQ 
IDNO:2. 

The present invention further provides the above nucleic acid molecule 
wherein said polynucleotide has a nucleotide sequence encoding the Pgi 
polypeptide encoded by a DNA clone contained in NRRL Deposit No. B-30 1 74. 
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The present invention further provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide having a nucleotide sequence identical to a 
nucleotide sequence in (a), (b), or (c) of the above nucleic acid molecule, 
wherein said polynucleotide which hybridizes does not hybridize under stringent 
hybridization conditions to a polynucleotide having a nucleotide sequence 
consisting of only A residues or of only T residues. 

The present invention further provides a method for making a 
recombinant vector comprising inserting the above isolated nucleic acid 
molecule into a vector. 

The present invention further provides a vector comprising the above 
nucleic acid molecule. The present invention further provides a method of 
making a recombinant host cell comprising introducing the above vector into a 
host cell. The present invention further provides a host cell comprising the 
above vector. The present invention further provides a method for producing a 
Pgi polypeptide, comprising culturing the above recombinant host cell under 
conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

The present invention further provides an isolated polypeptide selected 
from the group consisting of: 

(a) a polypeptide comprising amino acids from about 1 to 
about 540inSEQ IDNO:2; 

(b) a polypeptide comprising the amino acid sequence 
encoded by the DNA clone contained in NRRL Deposit 
No.B-30174; 

(c) a polypeptide variant created by altering the amino acid 
sequence of (a), wherein: 

(1) said altering includes an insertion, deletion, or 
substitution, or any combination thereof; and 
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(2) the number of alterations is equal to or less than 5 
% of the total number of amino acids present in 

(a) ; 

(d) a polypeptide variant created by altering the 
polynucleotide of (b), wherein: 

(1) said altering includes an insertion, deletion, or 
substitution, or any combination thereof; and 

(2) the number of alterations is equal to or less than 5 
% of the total number of amino acids present in 

(b) . 

Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using an automated DNA 
sequencer (such as the Model 373 from Applied Biosystems, Inc.), and all amino 
acid sequences of polypeptides encoded by DNA molecules determined herein 
were predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this 
automated approach, any nucleotide sequence determined herein may contain 
some errors. Nucleotide sequences determined by automation are typically at 
least about 90% identical, more typically at least about 95% to at least about 
99.9% identical to the actual nucleotide sequence of the sequenced DNA 
molecule. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a 
frame shift in translation of the nucleotide sequence such that the predicted amino 
acid sequence encoded by a determined nucleotide sequence will be completely 
different from the amino acid sequence actually encoded by the sequenced DNA 
molecule, beginning at the point of such an insertion or deletion. 

Using the information provided herein, such as the nucleotide sequence 
in Figure 1, a nucleic acid molecule of the present invention encoding a Pgi 
polypeptide may be obtained using standard cloning and screening procedures. 
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Thus, the present invention provides a nucleotide sequence encoding the 
Pgi polypeptide having the amino acid sequence encoded by the clone contained 
in the host identified as NRRL Deposit No. B-30174 and as shown in Figures 1 
(SEQIDNOs:l and 2). 

As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors, the predicted Pgi polypeptide encoded by the deposited clone 
comprise about 540 amino acids, but may be anywhere in the range of 500 to 580 
amino acids. 

As indicated, nucleic acid molecules of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 
may be the coding strand, also known as the sense strand, or it may be the 
non-coding strand, also referred io as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of isolated 
DNA molecules include recombinant DNA molecules maintained in heterologous 
host cells or purified (partially or substantially) DNA molecules in solution. 
Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA 
molecules of the present invention. Isolated nucleic acid molecules according to 
the present invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) shown in Figure 1 (SEQ ID 
NO:l); DNA molecules comprising the coding sequence for the Pgi protein 
shown in Figures 1 (SEQ ID NO:2); and DNA molecules which comprise a 
sequence substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode the Pgi protein. Of course, the 
genetic code is well known in the art. Thus, it would be routine for one skilled 
in the art to generate such degenerate variants. 
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In addition, the invention provides nucleic acid molecules having 
nucleotide sequences related to extensive portions of SEQ ID NO:l . 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the Pgi polypeptide having an amino acid sequence encoded by the 
nucleic acid molecule deposited as NRRL Deposit No. B-30174 on 
August 17, 1999. The invention further provides an isolated nucleic acid 
molecule having the nucleotide sequence shown in Figure 1 (SEQ ID NO:l) or 
the nucleotide sequence of the pgi genomic sequence contained in the 
above-described deposited clone, or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated molecules, 
particularly DNA molecules, are useful as probes for gene mapping, by in situ 
hybridization with chromosomes. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic 
acid molecule having the nucleotide sequence of the deposited clone or the 
nucleotide sequence shown in Figure 1 (SEQ ID NOs: 1 ) is intended fragments at 
least about 1 5 nucleotides (nt), and more preferably at least about 20 nt, still more 
preferably at least about 30 nt, and even more preferably, at least about 40 nt in 
length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50, 75, 100, 125, 150, 175,200, 225,250,300,325,350, 
375, 400, 425, 450, 475, 500, 525, 550 nt in length are also useful according to 
the present invention as are fragments corresponding to most, if not all, of the 
nucleotide sequence of the deposited clone or as shown in Figure 1 (SEQ ID 
NOs: 1 ). By a fragment at least 20 nt in length, for example, is intended fragments 
which include 20 or more contiguous bases from the nucleotide sequence of the 
deposited clone or the nucleotide sequence as shown in Figure 1 (SEQ ID NO: 1 ). 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the deposited clone contained in NRRL 
Deposit B-301 74. By "stringent hybridization conditions" is intended overnight 
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incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (1 50 mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x 
Denhardt's solution, 1 0% dextran sulfate, and 20 g/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in O.lx SSC at about 65 °C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide 
is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 
1 5 nucleotides (nt), and more preferably at least about 20 nt, still more preferably 
at least about 30 nt, and even more preferably about 30-70 nt of the reference 
polynucleotide. These are useful as diagnostic probes and primers as discussed 
above and in more detail below. 

By a portion of a polynucleotide of "at least 20 nt in length," for example, 
is intended 20 or more contiguous nucleotides from the nucleotide sequence of 
the reference polynucleotide (e.g., the deposited clone or the nucleotide sequence 
as shown in Figure 1 (SEQ ID NO:l). Of course, a polynucleotide which 
hybridizes only to a polyA sequence (such as the 3 r terminal poly(A) tract of the 
pgi cDNA shown in Figure 1 (SEQ ID NO: 1)), or to a complementary stretch of 
T (or U) residues, would not be included in a polynucleotide of the invention used 
to hybridize to a portion of a nucleic acid of the invention, since such a 
polynucleotide would hybridize to any nucleic acid molecule containing a poly 
(A) stretch or the complement thereof (e.g., practically any double-stranded 
cDNA clone). 

As indicated, nucleic acid molecules of the present invention which 
encode a Pgi polypeptide may include, but are not limited to those encoding the 
amino acid sequence of the polypeptide, by itself; the coding sequence for the 
polypeptide and additional sequences, such as those encoding an amino acid 
? leader or secretory sequence, such as a pre-, or pro- or prepro- protein sequence; 
the coding sequence of the polypeptide, with or without the aforementioned 
additional coding sequences, together with additional, non-coding sequences, 
including for example, but not limited to non-coding 5' and 3' sequences, such 
as the transcribed, non-translated sequences that play a role in transcription, 
mRNA processing, for example - ribosome binding and stability of mRNA; an 
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additional coding sequence which codes for additional amino acids, such as those 
which provide additional functionalities. Thus, the sequence encoding the 
polypeptide may be fused to a marker sequence, such as a sequence encoding a 
peptide which facilitates purification of the fused polypeptide. In certain 
preferred embodiments of this aspect of the invention, the marker amino acid 
sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector 
(Qiagen, Inc.), among others, many of which are commercially available. As 
described in Gentz et al y Proc. Natl Acad. Scl USA 55:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
The "HA" tag is another peptide useful for purification which corresponds to an 
epitope derived from the influenza hemagglutinin protein, which has been 
described by Wilson et ai 9 Cell 37: 767 (1984). As discussed below, other such 
fusion proteins include the Pgi fused to Fc at the - or C-terminus. 

The probes, primers, and/or nucleic acid fragments described above can 
be used to monitor expression of the pgi gene during fermentation. 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or derivatives 
of the Pgi protein. Variants may occur naturally, such as a natural allelic variant. 
By an "allelic variant" is intended one of several alternate forms of a gene 
occupying a given locus on a chromosome of an organism. Genes II, Lewin, B., 
ed., John Wiley & Sons, New York (1 985). Non-naturally occurring variants may 
be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions, which may involve one or more nucleotides. The variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the 
coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions: Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and 
activities of the Pgi protein or portions thereof. Also especially preferred in this 
regard are conservative substitutions. 
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Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 
95%, 96%, 97%, 98% or 99% identical to (a) a nucleotide sequence encoding the 
polypeptide having the amino acid sequence in SEQ ID NO:2; (b) a nucleotide 
sequence encoding the full-length Pgi polypeptide having the complete amino 
acid sequence encoded by the clone contained in NRRL Deposit No. B-30174; 
or (c) a nucleotide sequence complementary to any of the nucleotide sequences 
in (a) or (b). 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a Pgi polypeptide 
is intended that the nucleotide sequence of the polynucleotide is identical to the 
reference sequence except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the reference nucleotide 
sequence encoding the Pgi polypeptide. In other words, to obtain a 
polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may 
be deleted or substituted with another nucleotide, or a number of nucleotides up 
to 5% of the total nucleotides in the reference sequence may be inserted into the 
reference sequence. These mutations of the reference sequence may occur at the 
5' or 3' terminal positions of the reference nucleotide sequence or anywhere 
between those terminal positions, interspersed either individually among 
nucleotides in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in Figure 1 or to the nucleotides sequence of the deposited clones 
can be determined conventionally using known computer programs such as the 
Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, WI 5371 1. Bestfit uses the local homology algorithm of Smith and 
Waterman, Advances in Applied Mathematics 2: 482-489 (1 98 1 ), to find the best 
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segment of homology between two sequences. When using Bestfit or any other 
sequence alignment program to determine whether a particular sequence is, for 
instance, 95% identical to a reference sequence according to the present 
invention, the parameters are set, of course, such that the percentage of identity 
is calculated over the full length of the reference nucleotide sequence and that 
gaps in homology of up to 5% of the total number of nucleotides in the reference 
sequence are allowed. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Figure 
1 (SEQ ID NO:l) or to the nucleic acid sequence of the deposited clone, 
irrespective of whether they encode a polypeptide having Pgi activity. This is 
because even where a particular nucleic acid molecule does not encode a 
polypeptide having Pgi activity, one of skill in the art would still know how to use 
the nucleic acid molecule, for instance, as a hybridization probe or a polymerase 
chain reaction (PCR) primer. Uses of the nucleic acid molecules of the present 
invention that do not encode a polypeptide having Pgi activity include, inter alia, 
isolating the pgi gene or allelic variants thereof in a genomic library and Northern 
Blot analysis for detecting pgi mRNA expression. 

Preferred, however, are nucleic acid molecules having sequences at least 
95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
Figure 1 (SEQ ID NO:l) or to the nucleic acid sequence of the deposited clone 
which do, in fact, encode a polypeptide having Pgi protein activity. By "a 
polypeptide having Pgi activity" is intended polypeptides exhibiting activity 
similar, but not necessarily identical, to an activity of the Pgi protein of the 
invention, as measured in a particular biological assay. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to 
the nucleic acid sequence of the deposited clone or the nucleic acid sequence 
shown in Figure 1 (SEQ ID NO: 1) will encode a polypeptide "having Pgi protein 
activity." In fact, since degenerate variants of these nucleotide sequences all 



21 



WO 01/07626 PCT/US00/19914 

encode the same polypeptide, this will be clear to the skilled artisan even without 
performing the above described comparison assay. It will be further recognized 
in the art that, for such nucleic acid molecules that are not degenerate variants, a 
reasonable number will also encode a polypeptide having Pgi protein activity. 
This is because the skilled artisan is fully aware of amino acid substitutions that 
are either less likely or not likely to significantly effect protein function (e.g., 
replacing one aliphatic amino acid with a second aliphatic amino acid). 

For example, guidance concerning how to make phenotypically silent 
amino acid substitutions is provided in Bowie, J. U. et al, "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," 
1:1306-1310 (1990), wherein the authors indicate that proteins are surprisingly 
tolerant of amino acid substitutions. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of Pgi polypeptides 
or fragments thereof by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. If the vector is a virus, it may be packaged in 
vitro using an appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters to 
name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, 
termination and, in the transcribed region, a ribosome binding site for translation. 
The coding portion of the mature transcripts expressed by the constructs will 
preferably include a translation initiating at the beginning and a termination 
codon (UAA, UGA or UAG) appropriately positioned at the end of the 
polypeptide to be translated. 
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As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include but are not limited to kanamycin 
chloramphenicol, tetracycline or ampicillin resistance genes for culturing in E. 
coli and other bacteria. Representative examples of appropriate hosts include, but 
are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among known bacterial promoters suitable for use in the production of 
proteins of the present invention include the E. coli lac J and lacZ promoters, the 
T3 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the 
trp promoter. 

Thus, the present invention is also directed to expression vector useful for 
the production of the proteins of the present invention. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRJT5 available from Pharmacia. 
Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, such 
as Davis et al y Basic Methods In Molecular Biology (1986). 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
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removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques 
in the art. 

The Pgi protein can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include naturally purified products, products of chemical synthetic procedures, 
and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and 
mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be 
glycosylated or may be non-glycosylated. In addition, polypeptides of the 
invention may also include an initial modified methionine residue, in some cases 
as a result of host-mediated processes. 

Pgi Polypeptides and Fragments 

The invention further provides an isolated Pgi polypeptide having the 
amino acid sequence encoded by the deposited clone, or the amino acid sequence 
in Figure 1 (SEQ ID NO:2), or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

It will be recognized in the art that some amino acid sequences of the Pgi 
polypeptide can be varied without significant effect of the structure or function 
of the protein. If such differences in sequence are contemplated, it should be 
remembered that there will be critical areas on the protein which determine 
activity. 
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Thus, the invention further includes variations of the Pgi polypeptide 
which show substantial Pgi polypeptide activity or which include regions of Pgi 
protein such as the protein portions discussed below. Such mutants include 
deletions, insertions, inversions, repeats, and type substitutions. As indicated 
above, guidance concerning which amino acid changes are likely to be 
phenotypically silent can be found in Bowie, J.U., et al. t "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
247:1306-1310(1990). 

Thus, the fragment, derivative or analog of the polypeptide of Figure 1 
(SEQ ID NO:2), or the Pgi polypeptide encoded by the deposited clone, may be 
(i) one in which one or more of the amino acid residues are substituted with a 
conserved or non-conserved amino acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one 
encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is 
fused with another compound, such as a compound to increase the half-life of the 
polypeptide (for example, polyethylene glycol), or (iv) one in which the additional 
amino acids are fused to the mature polypeptide, such as an IgG Fc fusion region 
peptide or leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope of those skilled in the 
art from the teachings herein. 

As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding 
or activity of the protein (see Table 1). 
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TABLE 1 . Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above. Generally 
speaking, the number of amino acid substitutions for any given Pgi polypeptide 
will not be more than 50, 40, 30, 20, 10, 5, or 3. 

Amino acids in the Pgi protein of the present invention that are essential 
for function can be identified by methods known in the art, such as site-directed 
mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 
244: 1 08 1 -1 085 ( 1 989)), The latter procedure introduces single alanine mutations 
at every residue in the molecule. The resulting mutant molecules are then tested 
for phosphoglucose isomerase activity. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. By "isolated polypeptide" 
is intended a polypeptide removed from its native environment. Thus, a 
polypeptide produced and/or contained within a recombinant host cell is 
considered isolated for purposes of the present invention. Also intended as an 
"isolated polypeptide 5 ' are polypeptides that have been purified, partially or 
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substantially, from a recombinant host cell. For example, a recombinantly 
produced version of the Pgi polypeptide can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 57:31-40 (1 988). 

The polypeptides of the present invention include the Pgi polypeptide 
encoded by the deposited DNA and polypeptides which are at least 95% identical, 
still more preferably at least 96%, 97%, 98% or 99% identical to the polypeptides 
encoded by the deposited clone, to the polypeptide of Figure 1 (SEQ ID NO:2), 
and also include portions of such polypeptides with at least 30 amino acids and 
more preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a Pgi polypeptide is 
intended that the amino acid sequence of the polypeptide is identical to the 
reference sequence except that the polypeptide sequence may include up to five 
amino acid alterations per each 100 amino acids of the reference amino acid of 
the Pgi polypeptide. In other words, to obtain a polypeptide having an amino acid 
sequence at least 95% identical to a reference amino acid sequence, up to 5% of 
the amino acid residues in the reference sequence may be deleted or substituted 
with another amino acid, or a number of amino acids up to 5% of the total amino 
acid residues in the reference sequence may be inserted into the reference 
sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among residues 
in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in Figures 1 (SEQ ID NO:2) or to the amino acid sequence encoded by deposited 
clones can be determined conventionally using known computer programs such 
the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, WI 53711). When using Bestfit or any other sequence alignment 
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program to determine whether a particular sequence is, for instance, 95% 
identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in 
homology of up to 5% of the total number of amino acid residues in the reference 
sequence are allowed. 

The polypeptide of the present invention could be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. 

N-terminal and C-terminal Deletion Mutants 

In one embodiment, the present invention provides polypeptides having 
one or more residues deleted from the amino terminus ( of the amino acid sequence 
of the Pgi polypeptide depicted in Figure 1 or encoded by the DNA of the 
deposited clone. Particularly, in one embodiment, N-terminal deletions of the Pgi 
polypeptide can be described by the general formula m to 540, where m is any 
one of the integers from 2 to 539 corresponding to the position of the amino acid 
residue identified in SEQ ID NO:2 and, preferably, corresponds to one of the 
N-terminal amino acid residues identified in the N-terminal deletions specified 
herein. Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

Further embodiments of the invention are directed to C-terminal deletions 
of the Pgi polypeptides of the invention, described by the general formula 1 to n, 
where n is any one of the integers from 2 to 539 corresponding to the position of 
amino acid residue identified in SEQ ID NO:2, and preferably corresponds to a 
residue identified in one of the C-terminal deletions specified herein. 
Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

Further embodiments of the invention are directed to polypeptide 
fragments comprising, or alternatively, consisting of, amino acid residues 
described by the general formula m to n, where m and n correspond to any one 
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of the amino acid residues specified above for these symbols, respectively. 
Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

The following examples are illustrative only and are not intended to limit 
the scope of the invention as defined by the appended claims. It will be apparent 
to those skilled in the art that various modifications and variations can be made 
in the methods of the present invention without departing from the spirit and 
scope of the invention. Thus, it is intended that the present invention cover the 
modifications and variations of this invention provided they come within the 
scope of the appended claims and their equivalents. 

Examples 

Example 1 - DNA Isolation and Purification 

DNA was isolated from cultures ofNRRLB-1 1474 cells. NRRLB-11474 
cells were harvested from CM media (Table B) and suspended in 1 0 ml of TE pH 
8 (10 mM Tris*Cl, 1 mM EDTA). Forty micrograms of RNase A and 10 
milligrams of lysozyme were added per milliliter of suspension and the 
suspension was incubated at 37°C for 30 minutes. The suspension was made 
1 .0% in sodium dodecyl sulfate (SDS) and 0.1 mg/1 proteinase K was added, and 
the cells were lysed by incubation at 37°C for 10 minutes. Nucleic acids were 
purified by three extractions with TE-saturated phenol (pH 7), followed by 
ethanol precipitation. Nucleic acid precipitates were twice washed with 80% 
ethanol and redissolved in TE pH 8. The concentrations of DNA were quantified 
spectrophotometrically at 260 nm. Purity of DNA preparations were determined 
spectrophotometrically (A260/A280 and A260/A230 ratios) and by agarose gel 
electrophoresis (0.8% agarose in IxTAE). 

Sequencing of genomic DNA was performed, as is known by one of 
ordinary skill in the art, by creating libraries of plasmids and cosmids using 
pGEM3 and Lorist 6, respectively. The C. glutamicum pgi gene was identified 
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by homology to glucose-6-phosphate isomerase of Mycobacterium tuberculosis 
(Swiss Prot Accession number P77895, Swiss Prot ID G6PIMYCTU). 

Example 2 - Increasing NADPH Availability by Disrupting pgi 

An increase in carbon flux through the oxidative branch of the pentose 
phosphate pathway was achieved by disrupting the pgi gene which encodes 6- 
phosphoglucose isomerase. Two PCR primers were designed from the genomic 
DNA sequence described above to facilitate the amplification of a 680 bp internal 
region of the C. glutamicum pgi gene. These primers were: 

pgif* (SEQ ID NO:3) 5' gctgatgtccacgaagctttgggac 3' 

pgir* (SEQ ID NO:4) 3' gctgagaaccttggaataaggtagg 3' 

Primers pgif* and pgir* contain the recognition sequence for the 
restriction enzyme Hind III. In the case of pgir*, it was necessary to make three 
changes from the C. glutamicum nucleotide sequence to incorporate the Hind III 
recognition sequence. These Hind III restriction sites facilitated subcloning. 

PCR amplification conditions were employed as follows. The final 
volume of each PCR reaction was 1 00 \xl. 1 00 ng of each primer was used along 
with 50 ng of high molecular weight C glutamicum ATCC 2 1 799 genomic DNA 
and 2.5 units of Taq DNA polymerase. Reaction buffer was included at a 
concentration recommended by the manufacturer (Stratagene) and dNTPs were 
also included at a final concentration of 200 jiM. Cycling parameters were as 
follows: 94 °C for 1 minute, followed by 94 °C for 30 seconds, 60 °C for 30 
seconds and 72 °C for 1 minute (30 cycles), 72 °C for 7 minutes followed by 
refrigeration. 

On restriction with Hind III, the PCR product was reduced in size to 
approximately 660 bp. This fragment was then subcloned into the suicide vector 
pBGS 131. The resulting subclone was designated pDPTpgi2. 
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Following electrotransformation into competent C. glutamicum (NRRL 
B11474) cells, integrants were selected for on CM (Table B) agar plates 
containing kanamycin at a final concentration of 10 ng/ml. Enzyme assay 
confirmed the absence of phosphoglucose isomerase activity in the mutant strains, 
indicating that the pgi gene in these strains had been disrupted. 

Shake flask experiments indicate that the C glutamicum (NRRL B 1 1 474) 
pgi mutants have improved lysine titers and yields when compared to C. 
glutamicum (NRRL Bl 1474) (Table A). 

Table A: Lysine production on FM3 (Table C) media 

Strain Growth Titer Yield 

NRRL Bl 1474 46 25 42 

NRRLB11474::pgi2A 40 31 52 

Growth = optical density at 660 run 

Titer = grams of lysine /liter of medium 

Yield = (grams of lysine / grams of glucose consumed)* 1 00 



Table B: CM Media 

Volume: 1000 ml %Agar: 0 



Sucrose 


50 g 


KH2P04 


0.5 g 


K2HP04 


1.5 g 


Urea 


3g 


MgS04 * 7H20 


0.5 g 


Polypeptone 


20 g 


Beef Extract 


5g 


Biotin 


12.5ml(60mg/L) 


Thiamine 


25 ml (120mg/L) 


Niacinamide 


25 ml (5g/L) 


L-Methionine 


0.5 g 


L-Threonine 


0.25 g 


L-AIanine 


0.5 g 


Bring to volume 


1000 mis with DI water. 


pH - about 7.1 
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50g 

lg 

0.4g 

O.Olg 

O.Olg 

0.03mg 

4% dry solids final concentration 

6% final concentration 

50g 



Example 3 - Disrupting the Gene Encoding 6-Phosphofructokinase (pfkA) 

The gene encoding for 6-phosphofructokinase (pfkA) was disrupted in a 
method similar to that described for the pgi gene in Example 2 . Disruption of the 
pfkA gene was verified by enzyme assay of extracts of the mutants and showed 
that 6-phosphofructokinase activity was lacking. Unexpectedly, the C. 
glutamicum (NRRL B-l 1474) pfkA mutants were unable to utilize glucose. 

***** 

All patents and publications referred to herein are hereby expressly 
incorporated in their entirety by reference. 

While the foregoing invention has been described in some detail for 
purposes of clarity and understanding, it will be appreciated by one skilled in the 
art from a reading of this disclosure that various changes in form and detail can 
be made without departing from the true scope of the invention and appended 
claims. 



Table C: FM3 Media 

Per liter 

(NH 4 ) 2 S0 4 

KH 2 P0 4 

MgS0 4 *7H 2 0 

MnS0 4 *H 2 0 

FeS0 4 *7H 2 0 

Biotin 

Corn Steep Liquor 

Glucose 

CaCO, 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule 13 bis) 



A. The indications made below relate to the microorganism referred to in the description on page 2 line 
-)-> • 



B. IDENTIFICATION OF DEPOSIT F, t nh.r^ci,, w iH n r f^ ^ .u n 

. runner deposits are identified on an addtnonal sheet □ 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 



Address of depositary institution (including postal code and country) 

1 8 1 5 N. University Street 
Peoria. Illinois 61604 
United States or* America 



Date of deposit 


Accession Number 


August 17. 1999 


NRRL B-30174 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



Escherichia coli DH5a p41-13(CO!) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not appticabU j 

The indications listed below will be submitted to the international Bureau later { specify the general nature of the indications, e.g.. 
"Accession Number of Deposit") 







C^This sheet was received with the international application 


□ This sheet was received by the International Bureau on: 


Authorized t ofOcec . 

LyueiiMeaaows 
! PCT Operations - iATOTe.j-ii . 
1 700) 000 07^ ^ 


Authorized officer 
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1 . A method of producing L-amino acids comprising: 

culturing an altered bacterial cell having an increased amount of 
NADPH as compared to an unaltered bacterial cell, wherein L-amino acid yields 
from said altered bacterial cell are greater than yields from an unaltered bacterial 
cell. 

2. The method of claim 1 , wherein said altered bacterial cell has 
increased carbon flux through the oxidative branch of the pentose phosphate 
pathway. 

3. The method of claim 2, wherein said altered bacterial cell has an 
increased amount of one or more enzymes selected from the group comprising 
glucose-6-phosphate dehydrogenase, lactonase and 6-phosphogluconate 
dehydrogenase. 

4. The method of claim 1, wherein said altered bacterial cell has a 
decreased carbon flux through the glycolytic pathway. 

5. The method of claim 4, wherein said altered bacterial cell has a 
decreased amount of 6-phosphoglucose isomerase enzymatic activity. 

6. The method of claim 1, wherein said L-amino acid yields from 
said altered bacterial cell are from about 1 % to about 1 00% greater than from said 
unaltered bacterial cell. 

7. The method of claim 1 , wherein said altered bacterial cell has a 
mutant pgi gene. 

8. The method of claim 1, wherein said altered bacterial cell is 
produced by 
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(a) subcloning an internal region of a pgi gene; and 

(b) inserting said resulting vector from step (a) into a bacterial 
genome via homologous recombination. 

9. The method of claim 1 , wherein said altered bacterial cell has an 
increased amount of malic enzyme. 

1 0. The method of claim 1 , wherein said altered bacterial cell has an 
increased amount of isocitrate dehydrogenase. 

1 1 . The method of claim 1 , wherein said altered bacterial cell is a 
Coryne bacterium glutamicum cell. 

12. The method of claim 11, wherein said Corynebacterium 
glutamicum cell has a gene selected from the group consisting of a mutated pgi 
gene. 

13. The method of claim 1 , wherein said L-amino acid comprises L- 

lysine. 

1 4. A vector comprising pDPTpgi2. 

15. A method of producing a bacterial cell with a mutated pgi gene 
comprising: 

(a) subcloning an internal region of the pgi gene into a suicide 
vector; and 

(b) inserting said resulting vector from step (a) into a bacterial 
genome whereby a bacterial cell with an altered pgi gene 
is produced. 
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16. The method of claim 15, wherein said suicide vector is selected 
from the group comprising pBGS 1 3 1 and Col El based replicons with selectable 
marker. 

17. An altered bacterial cell produced according to the method of 
claim 15. 

18. A method of producing L-amino acids comprising: 
culturing an altered bacterial cell having a decreased amount of 6- 

phosphoglucose isomerase enzymatic activity as compared to an unaltered 
bacterial cell wherein L-amino acid yields from said altered bacterial cell are 
greater than yields from an unaltered bacterial cell. 

19. The method of claim 1 8, wherein said L-amino acid yields from 
said altered bacterial cell are from about 1 % to about 1 00% greater than from said 
unaltered bacterial cell. 

20. The method of claim 1 8, wherein said altered bacterial cell has a 

7 

mutant pgi gene. 

21. The method of claim 18, wherein said altered bacterial cell is 
produced by 

(a) subcloning an internal region of a pgi gene; and 

(b) inserting said resulting vector from step (a) into a bacterial 
genome via homologous recombination. 

22. The method of claim 18, wherein said altered bacterial cell is a 
Corynebacterium glutamicum cell. 

23. The method of claim 1 8, wherein said L-amino acid comprises L- 

lysine. 
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v 

ATGGCG6ACAT7TCGACCACCCAGGCTTGGCM6ACCTGACCGATCATTACTCAMCTTC 
1 + + + + + + 60 

MAD I STTQAWQDLTDHYSNF 

CAGGCMCCACTCTGCGTGMCTTTTCAAGGAAGAAAACCGCGCCGAGAAGTACACCTTC 
61 +--- + + + + + 120 

QATTL RE LFKEENRAE KYTF 

TCCGCGGCTGGCCTCCACGTCGACCTGTCGAAGAATCTGCTTGACGACGCCACCCTCACC 
121 + + + + + + 180 

SAAGLHVDLSKNLLDDATLT 

MGCTCCTTGCACTGACCGAAGAATCTGGCCTTCGCGAACGCATTGACGCGATGTTTGCC 
181 -■- + + + + + + 240 

KLLALTEESGLRERI DAMFA 

GGTGAACACCTCAACAACACCGAAGACCGCGCTGTCCTCCACACCGCGCTGCGCCTTCCT 
241 + + + ---+ + + 300 

GEHLNNTEDRA VLHTALRLP 

CCCGAAGCTGATCTGTCAGTAGATGGCCAAGATGTTGCTGCTGATGTCCACGAAGTTTTG 
301 + + + + + + 360 

PEADLSVDGQDVAADVHEVL 

GGACGCATGCGTGACTTCGCTACTGCGCTGCGCTCAGGCAACTGGTTGGGACACACCGGC 
361 + + + + + + 420 

GRMRDFATALRSGNWLGHTG 

CACACGATCAAGAAGATCGTCAACATTGGTATCGGTGGCTCTGACCTCGGACCAGCCATG 
421 + + + + + + 480 

HTIKKIVNIGIGGSDLGPAM 

GCTACGAAGGCTCTGCGTGCATACGCGACCGCTGGTATCTCAGCAGAATTCGTCTCCAAC 
481 + + + + + + 540 

ATKALRAYATAGI SAEFVSN 

GTCGACCCAGCAGACCTCGTTTCTGTGTTGGAAGACCTCGATGCAGAATCCACATTGTTC 
541 + + + + + + 600 

VDPADLVSVLEDLDAESTLF 

FIG.1A 
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GTGATCGCTTCGAAAACTTTTACCACCCAGGAGACGCTGTCTAACGCTCGTGCAGCTCGT 
601 + +•-• +- + + + 660 

VIASKTFTTQETLSNARAAR 

GCTTGGCTGGTAGAGAAGCTCGGTGAAGAGGCTGTCGCGAAGCATTTCGTCGCAGTGTCC 
661 - + + + + + + 720 

AWLVEK LGEEAVAKHFVAVS 

ACCAATGCTGAAAAGGTCGCAGAGTTCGGTATCGACACGGACAACATGTTCGGCTTCTGG 
721 + + + + + + 780 

TNAEKVAEFGIDTDNMFGFW 

GACTGGGTCGGAGGTCGTTACTCCGTGGACTCCGCAGTTGGTCTTTCCCTCATGGCAGTG 
781 + + + + + + 840 

DWVGGRYSVDSAVGLSLMAV 

ATCGGCCCTCGCGACTTCATGCGTTTCCTCGGTGGATTCCACGCGATGGATGAACACTTC 
841 + + +- + + + 900 

IGPRDFMRFLGGFHAMDEHF 

CGCACCACCMGTTCGAAGAGAACGTTCCAATCTTGATGGCTCTGCTCGGTGTCTGGTAC 
901 + + + + + + 960 

RTTKFEENVPILMALLGVWY 

TCCGATTTCTATGGTGCAGAAACCCACGCTGTCCTACCTTATTCCGAGGATCTCAGCCGT 
961 + + + + + + 1020 

SDFYGAETHAVLPYSEDLSR 

TTTGCTGCTTACCTCCAGCAGCTGACCATGGAATCAAACGGGAAGTCAGTCCACCGCGAC 
1021 --- +-- ■---+ + --+ + + 1080 

FAAYLQQLTMESNGKSVHRD 

GGCTCCCCTGTTTCCACTGGCACTGGCGAAATTTACTGGGGTGAGCCTGGCACAAATGGC 
1081 + + +- + + + H40 

GSPVSTGTGEIYWGEPGTNG 

CAGCACGC7TTCTTCCAGCTGATCCACCAGGGCACTCGCCTTGTTCCAGCTGATTTCATT 
1141 + + + + + + 1200 

QHAFFQLIHQGTRLVPADFI 

FIG.1B 
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3/3 

GGTTTCGCTCGTCCAAAGCAGGATCTTCeTGCCGGTGAGCGCACCATGCATGACCTTTTG 
1201 + + + +.- + + 1260 

GFARPKQDLPAGE RTMHDLL 

ATGAGCMCTTCTTCGCACAGACCMGGTTtTGGCTTTCGGTAAGAACGCTGAAGAGATC 
1261 + + + + + + 1320 

MSNFFAQTKVLAFGKNAEEI 

GCTGCGGAAGGTGTCGCACCTGAGCTGGTCAACCACAAGGTCATGCCAGGTAATCGCCCA 
1321 + + + + + + 1380 

AAEGVAPELVNHKVMPGNRP 

ACCACCACCATTnGGCGGAGGMCTTACCCCTTCTATTCTCGGTGCGTTGATCGCTTTG 
1381 + + + +-- + + 1440 

TTTI LAEELTPSILGAL IAL 

TACGAACACATCGTGATGGTTCAGGGCGTGATTTGGGACATCAACTCCTTCGACCAATGG 
1441 + + + + --+ + 1500 

YEHIVMVQGVIWDINSFDQW 

GGTGTTGAACTGGGCAAACAGCAGGCAAATGACCTCGCTCCGGCTGTCTCTGGTGAAGAG 
1501 + + + + + + 1560 

GVELGKQQANDLAPAV SGEE 

GATGTTGACTCGGGAGATTCTTCCACTGATTCACTGATTAAGTGGTACCGCGCAAATAGG 
1561 + + + + + + 1620 

DVDSGDSSTDSLIKWYRANR 
TAG 

1621 --- 1623 
* 

FIG.1C 
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SEQUENCE LISTING 



<110> O'Donohue, Michael R. 
Hanke, Paul D. 

<120> Methods for Producing L-Amino Acids 

<130> 1533.101PC02 

<140> 
<141> 

<150> US 60/150,017 
<151> 1999-08-20 

<150> US 60/145,217 
<151> 1999-07-23 

<160> 4 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1623 
<212> DNA 

<213> Corynebacterium glut ami cum 

<220> 

<221> CDS 

<222> (1)..{1620) 

<400> 1 

atg gcg gac att teg acc acc cag get tgg caa gac ctg acc gat cat 48 
Met Ala Asp He Ser Thr Thr Gin Ala Trp Gin Asp Leu Thr Asp His 
15 10 15 

tac tea aac ttc cag gca acc act ctg cgt gaa ctt ttc aag gaa gaa 96 
Tyr Ser Asn Phe Gin Ala Thr Thr Leu Arg Glu Leu Phe Lys Glu Glu 
20 25 30 

aac cgc gec gag aag tac acc ttc tec gcg get ggc etc cac gtc gac 144 
Asn Arg Ala Glu Lys Tyr Thr Phe Ser Ala Ala Gly Leu His Val Asp 
35 40 45 



ctg teg aag aat ctg ctt gac gac gee acc etc acc aag etc ctt gca 192 
Leu Ser Lys Asn Leu Leu Asp Asp Ala Thr Leu Thr Lys Leu Leu Ala 
50 55 60 

ctg acc gaa gaa tct ggc ctt cgc gaa cgc att gac gcg atg ttt gee 240 
Leu Thr Glu Glu Ser Gly Leu Arg Glu Arg He Asp Ala Met Phe Ala 
65 70 75 80 

ggt gaa cac etc aac aac acc gaa gac cgc get gtc etc cac acc gcg 288 
Gly Glu His Leu Asn Asn Thr Glu Asp Arg Ala Val Leu His Thr Ala 
85 90 95 

ctg cgc ctt cct ccc gaa get gat ctg tea gta gat ggc caa gat gtt 336 
Leu Arg Leu Pro Pro Glu Ala Asp Leu Ser Val Asp Gly Gin Asp Val 
100 105 110 

get get gat gtc cac gaa gtt ttg gga cgc atg cgt gac ttc get act 384 
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Ala Ala Asp Val His Glu Val Leu Gly Arg Met Arg Asp Phe Ala Thr 
115 120 125 

gcg ctg cgc tea ggc aac tgg ttg gga cac acc ggc cac acg ate aag 4 32 
Ala Leu Arg Ser Gly Asn Trp Leu Gly His Thr Gly His Thr lie Lys 
130 135 140 

aag ate gtc aac att ggt ate ggt ggc tct gac etc gga cca gee atg 4 80 
Lys lie Val Asn lie Gly lie Gly Gly Ser Asp Leu Gly Pro Ala Met 
145 150 155 160 

get acg aag get ctg cgt gca tac gcg acc get ggt ate tea gca gaa 528 
Ala Thr Lys Ala Leu Arg Ala Tyr Ala Thr Ala Gly lie Ser Ala Glu 
165 170 ^ 175 

ttc gtc tec aac gtc gac cca gca gac etc gtt tct gtg ttg gaa gac 576 
Phe Val Ser Asn Val Asp Pro Ala Asp Leu Val Ser Val Leu Glu Asp 
180 185 190 

etc gat gca gaa tec aca ttg ttc gtg ate get teg aaa act ttt acc 624 
Leu Asp Ala Glu Ser Thr Leu Phe Val lie Ala Ser Lys Thr Phe Thr 
195 200 205 

acc cag gag acg ctg tct aac get cgt gca get cgt get tgg ctg gta 672 
Thr Gin Glu Thr Leu Ser Asn Ala Arg Ala Ala Arg Ala Trp Leu Val 
210 215 220 

gag aag etc ggt gaa gag get gtc gcg aag cat ttc gtc gca gtg tec 720 
Glu Lys Leu Gly Glu Glu Ala Val Ala Lys His Phe Val Ala Val Ser 
225 230 235 240 

acc aat get gaa aag gtc gca gag ttc ggt ate gac acg gac aac atg 768 
Thr Asn Ala Glu Lys Val Ala Glu Phe Gly He Asp Thr Asp Asn Met 
245 250 255 

ttc ggc ttc tgg gac tgg gtc gga ggt cgt tac tec gtg gac tec gca 816 
Phe Gly Phe Trp Asp Trp Val Gly Gly Arg Tyr Ser Val Asp Ser Ala 
260 265 270 

gtt ggt ctt tec etc atg gca gtg ate ggc cct cgc gac ttc atg cgt 864 
Val Gly Leu Ser Leu Met Ala Val He Gly Pro Arg Asp Phe Met Arg 
275 280 285 

ttc etc ggt gga ttc cac gcg atg gat gaa cac ttc cgc acc acc aag 912 
Phe Leu Gly Gly Phe His Ala Met Asp Glu His Phe Arg Thr Thr Lys 
290 295 300 

ttc gaa gag aac gtt cca ate ttg atg get ctg etc ggt gtc tgg tac 960 
Phe Glu Glu Asn Val Pro He Leu Met Ala Leu Leu Gly Val Trp Tyr 
305 310 315 320 

tec gat ttc tat ggt gca gaa acc cac get gtc eta cct tat tec gag 1008 
Ser Asp Phe Tyr Gly Ala Glu Thr His Ala Val Leu Pro Tyr Ser Glu 
325 330 335 

gat etc age cgt ttt get get tac etc cag cag ctg acc atg gaa tea 1056 
Asp Leu Ser Arg Phe Ala Ala Tyr Leu Gin Gin Leu Thr Met Glu Ser 
340 345 350 

aac ggc aag tea gtc cac cgc gac ggc tec cct gtt tec act ggc act 1104 
Asn Gly Lys Ser Val His Arg Asp Gly Ser Pro Val Ser Thr Gly Thr 
355 360 365 
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ggc gaa att tac tgg ggt gag cct ggc aca aat ggc cag cac get ttc 1152 
Gly Glu lie Tyr Trp Gly Glu Pro Gly Thr Asn Gly Gin His Ala Phe 
370 375 380 

ttc cag ctg ate cac cag ggc act cgc ctt gtt cca get gat ttc att 1200 
Phe Gin Leu lie His Gin Gly Thr Arg Leu Val Pro Ala Asp Phe lie 
385 390 395 400 

ggt ttc get cgt cca aag cag gat ctt cct gec ggt gag cgc ace atg 124 8 
Gly Phe Ala Arg Pro Lys Gin Asp Leu Pro Ala Gly Glu Arg Thr Met 
4 05 410 415 

cat gac ctt ttg atg age aac ttc ttc gca cag ace aag gtt ttg get 1296 
His Asp Leu Leu Met Ser Asn Phe Phe Ala Gin Thr Lys Val Leu Ala 
420 425 430 

ttc ggt aag aac get gaa gag ate get gcg gaa ggt gtc gca cct gag 1344 
Phe Gly Lys Asn Ala Glu Glu lie Ala Ala Glu Gly Val Ala Pro Glu 
435 440 445 

ctg gtc aac cac aag gtc atg cca ggt aat cgc cca acc acc acc att 1392 
Leu Val Asn His Lys Val Met Pro Gly Asn Arg Pro Thr Thr Thr He 
450 455 460 

ttg gcg gag gaa ctt acc cct tct att etc ggt gcg ttg ate get ttg 144 0 
Leu Ala Glu Glu Leu Thr Pro Ser He Leu Gly Ala Leu He Ala Leu 
465 470 475 480 

tac gaa cac ate gtg atg gtt cag ggc gtg att tgg gac ate aac tec 1488 
Tyr Glu His He Val Met Val Gin Gly Val He Trp Asp He Asn Ser 
485 490 495 

ttc gac caa tgg ggt gtt gaa ctg ggc aaa cag cag gca aat gac etc 1536 
Phe Asp Gin Trp Gly Val Glu Leu Gly Lys Gin Gin Ala Asn Asp Leu 
500 505 510 

get ccg get gtc tct ggt gaa gag gat gtt gac teg gga gat tct tec 1584 
Ala Pro Ala Val Ser Gly Glu Glu Asp Val Asp Ser Gly Asp Ser Ser 
515 520 525 

act gat tea ctg att aag tgg tac cgc gca aat agg tag 1623 
Thr Asp Ser Leu lie Lys Trp Tyr Arg Ala Asn Arg 
530 535 540 



<210> 2 
<211> 540 
<212> PRT 

<213> Corynebacterium glutamicum 
<400> 2 

Met Ala Asp He Ser Thr Thr Gin Ala Trp Gin Asp Leu Thr Asp His 
15 10 15 

Tyr Ser Asn Phe Gin Ala Thr Thr Leu Arg Glu Leu Phe Lys Glu Glu 
20 25 30 

Asn Arg Ala Glu Lys Tyr Thr Phe Ser Ala Ala Gly Leu His Val Asp 
35 40 45 



Leu Ser Lys Asn Leu Leu Asp Asp Ala Thr Leu Thr Lys Leu Leu Ala 
50 55 60 
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Leu Thr Glu Glu Ser Gly Leu Arg Glu Arg lie Asp Ala Met Phe Ala 
65 70 75 80 

Gly Glu His Leu Asn Asn Thr Glu Asp Arg Ma Val Leu His Thr Ala 
85 90 95 

Leu Arg Leu Pro Pro Glu Ala Asp Leu Ser Val Asp Gly Gin Asp Val 
100 105 110 

Ala Ala Asp Val His Glu Val Leu Gly Arg Met Arg Asp Phe Ala Thr 
115 120 125 

Ala Leu Arg Ser Gly Asn Trp Leu Gly His Thr Gly His Thr lie Lys 
130 135 140 

Lys He Val Asn He Gly He Gly Gly Ser Asp Leu Gly Pro Ala Met 
145 150 155 160 

Ala Thr Lys Ala Leu Arg Ala Tyr Ala Thr Ala Gly He Ser Ala Glu 
165 170 175 

Phe Val Ser Asn Val Asp Pro Ala Asp Leu Val Ser Val Leu Glu Asp 
180 185 190 

Leu Asp Ala Glu Ser Thr Leu Phe Val He Ala Ser Lys Thr Phe Thr 
195 200 205 

Thr Gin Glu Thr Leu Ser Asn Ala Arg Ala Ala Arg Ala Trp Leu Val 
210 215 220 

Glu Lys Leu Gly Glu Glu Ala Val Ala Lys His Phe Val Ala Val. Ser 
225 230 ~ 235 240 

Thr Asn Ala Glu Lys Val Ala Glu Phe Gly lie Asp Thr Asp Asn Met 
245 250 " 255 

Phe Gly Phe Trp Asp Trp Val Gly Gly Arg Tyr Ser Val Asp Ser Ala 
260 265 270 

Val Gly Leu Ser Leu Met Ala Val He Gly Pro Arg Asp Phe Met Arg 
275 280 285 

Phe Leu Gly Gly Phe His Ala Met Asp Glu His Phe Arg Thr Thr Lys 
290 295 300 

Phe Glu Glu Asn Val Pro He Leu Met Ala Leu Leu Gly Val Trp Tyr 
305 310 315 320 

Ser Asp Phe Tyr Gly Ala Glu Thr His Ala Val Leu Pro Tyr Ser Glu 

325 330 . 335 

Asp Leu Ser Arg Phe Ala Ala Tyr Leu Gin Gin Leu Thr Met Glu Ser 
340 345 350 

Asn Gly Lys Ser Val His Arg Asp Gly Ser Pro Val Ser Thr Gly Thr 
355 360 365 

Gly Glu He Tyr Trp Gly Glu Pro Gly Thr Asn Gly Gin His Ala Phe 
370 375 380 



Phe Gin Leu lie His Gin Gly Thr Arg Leu Val Pro Ala Asp Phe He 
385 390 395 400 
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Gly Phe Ala Arg Pro Lys Gin Asp Leu Pro Ala Gly Glu Arg Thr Met 
405 410 415 



His Asp Leu Leu 
420 



Met Ser Asn Phe Phe Ala Gin Thr Lys Val Leu Ala 
425 430 



Phe Gly Lys Asn 
435 



Ala Glu Glu lie Ala Ala Glu Gly Val Ala Pro Glu 
440 445 



Leu Val Asn His 
450 



Lys Val Met Pro Gly Asn Arg Pro Thr Thr Thr lie 
455 460 



Leu Ala Glu Glu 
4 65 



Leu Thr Pro Ser lie Leu Gly Ala Leu lie Ala Leu 
470 475 480 



Tyr Glu His He Val Met Val Gin Gly Val He Trp Asp He Asn Ser 
485 490 495 



Phe Asp Gin Trp 
500 



Gly Val Glu Leu Gly Lys Gin Gin Ala Asn Asp Leu 
505 510 



Ala Pro Ala Val 
515 



Ser Gly Glu Glu Asp Val Asp Ser Gly Asp Ser Ser 
520 525 



Thr Asp Ser Leu 
530 



He Lys Trp Tyr Arg Ala Asn Arg 
535 540 



<210> 3 
<211> 25 
<212> DNA 

<213> Corynebacterium glutamicum 
<400> 3 

gctgatgtcc acgaagcttt gggac 25 



<210> 4 
<211> 25 
<212> DNA 

<213> Corynebacterium glutamicum 



<400> 4 

gctgagaacc ttggaataag gtagg 
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